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LIMIT THEOREMS FOR BIFURCATING MARKOV CHAINS. 
APPLICATION TO THE DETECTION 
OF CELLULAR AGING 

By Julien Guyon 
ENPC-CERMICS 

We propose a general method to study dependent data in a bi- 
nary tree, where an individual in one generation gives rise to two 
different offspring, one of type and one of type 1, in the next gener- 
ation. For any specific characteristic of these individuals, we assume 
that the characteristic is stochastic and depends on its ancestors' 
only through the mother's characteristic. The dependency structure 
may be described by a transition probability P{x,dydz) which gives 
the probability that the pair of daughters' characteristics is around 
{y,z), given that the mother's characteristic is x. Note that y, the 
characteristic of the daughter of type 0, and z, that of the daughter 
of type 1, may be conditionally dependent given x, and their respec- 
tive conditional distributions may differ. We then speak of bifurcating 
Markov chains. 

We derive laws of large numbers and central limit theorems for 
such stochastic processes. We then apply these results to detect cel- 
lular aging in Escherichia Coli, using the data of Stewart et al. and 
a bifurcating autoregressive model. 

1. Introduction. 

1.1. Motivation. This study has been motivated by a collaboration [13] 
with biologists from the Laboratoire de Genetique Moleculaire, Evolutive et 
Medicale (INSERM U571, Faculte de Medecine Necker, Paris). F. Taddei, 
E. J. Stewart, A. Lindner and G. Paul, together with R. Madden from the 
Institut des Hautes Etudes Scientifiques, have been working on Escherichia 
CoWs aging. E. Coli is a single-celled, model organism. It has been widely 
studied by the biologists who have gathered a large amount of information 
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on its physiology. Whereas aging is obvious in macroscopic organisms, it is 
not in single-cehed ones, where, nevertheless, one has the best chances of 
describing and quantifying the molecular process involved. It is especially 
hard to identify in E. Coli, which reproduces without a juvenile phase and 
with an apparently symmetric division. Stewart et al. [22] have designed 
an experimental protocol which brings evidence of aging in E. Coli and we 
propose a statistical study of the data they collected. 

In this section we describe the biological experiment and present the data 
(Section 1.2). Between-experiment averaging shows a clear segregation be- 
tween the new- and old-pole derived progeny (see Section 1.2.1), whereas 
single-experiment data does not. In Section 1.3 we propose a linear Gaussian 
model that allows to study the populations of old and new poles experiment- 
wise. The model consists of a bifurcating Markovian dynamics. This moti- 
vates Section 2, where we give a detailed study of such stochastic processes. 
We pay special attention to limit theorems such as laws of large numbers 
(Theorems 8, 11 and 14) and central limit theorems (Theorem 19 and its 
corollaries). Eventually, in Section 3, we apply these results to the model, 
proving strong laws of large numbers and a central limit theorem (see Propo- 
sitions 27 and 28), and derive rigorous estimation and test procedures which 
are performed on the data in order to detect cellular aging. 

1.2. The biology. Here we briefly describe E. ColVs life cycle, the exper- 
iment designed by Stewart et al. and the data they get. Figure 1 is taken 
from [22] where one can find further information. 

1.2.1. The experiment. E. Coli is a rod-shaped bacterium. It reproduces 
by dividing in the middle, producing a new end per progeny cell (see Figure 
1). This new end is called the new pole, whereas the other end is pre-existing 
and is called the old pole. 

This defines an age in divisions for each pole, and hence, for each cell. One 
expects any cell component formed in the poles and with limited diffusion 
to accumulate at the old pole, so that there might be a physiological asym- 
metry between the old and new poles. To determine if E. Coli experiences 
aging related to the inheritance of the old pole, Stewart et al. followed 95 
individual exponentially growing cells through up to nine generations in an 
automated fluorescence microscopy system which allowed them to determine 
the complete lineage, the identity of each pole and, among other physical 
parameters, the growth rate of each cell. Let us now present their results. 

1.2.2. Original data. Each of the 95 films gives rise to a genealogical tree 
such as the ones in Figure 2. The new poles are the solid lines and the old 
poles the dashed lines. On the y-axis appears the growth rate, whereas the 
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X-axis displays time in divisions. There is no striking evidence for repro- 
ductive asymmetry between the progeny cells visible to the naked eye. Note 
that generally the data is not regular: some generations are not completely 
observed, and in few cases a cell's growth rate might be measured whereas 
her sister's is not. 

1.2.3. Averaged data. In order to eliminate the random effects which 
appear in Figure 2, Stewart et al. have averaged the 95 lineages by each 
unique cell position within the lineage. Figure 3 is the average tree thus 
produced for generations 5, 6 and 7. It clearly shows a segregation between 




Fig. 1. The life cycle of E. Coli, from [22j. 




Fig. 2. Two single- experiment data trees (two films). 
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Fig. 3. The average data tree 



the new and old poles. The old poles grow slower than the new poles, which 
is evidence for aging. 

However, we would prefer to study each experiment separately, since we do 
not know if the experiments are independent and/or identically distributed. 
Indeed, two initial cells giving rise to two different films are actually taken 
from the same macrocolony, so that there might be a correlation between 
the experiments. Moreover, as shown in Figure 2, the range of values of the 
growth rate changes from film to film, probably due to a slight change in the 
experimental conditions. In the next section we propose a statistical model 
that allows us to study the populations of old and new poles experiment- 
wise. It also has the advantage of taking into account the structure of the 
dependencies within a lineage. To be precise, contrary to Stewart et al., we 
take the effect of the mother into account, and we will prove that this effect 
is important (see Remark 39). 

1.3. The mathematical model. In order to describe the dynamics of the 
growth rate, let Xi denote the growth rate of individual i and n denote the 
mother of 2n — the new pole progeny cell — and 2n + 1 — the old pole progeny 
cell; see Figure 4. We propose the following Markovian model with memory 
one: Xi, the ancestor's growth rate, has distribution u and for all n > 1, 



where ao,ai € (—1, 1), /3o>/9i € M and ((e2n, e2n+i), > 1) forms a sequence 
of independent and identically distributed (i.i.d.) centered bivariate Gaus- 
sian random variables (r.v.), say, (e2n,e2n+i) ~ A/2(0,r) with 



(1) 



X2n+l = aiXn + Pl + e2n+l 
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Fig. 4. The binary tree T. 

{e2n and £2^+1 are thus supposed to have common variance ct^). We speak 
of memory one because a ceh's growth rate is explained by its mother's. For 
instance, a Markovian model with memory two would also take into account 
the grandmother's growth rate. Considering Markovian models with memory 
two would allow us to test whether the grandmother effect is significant. In 
this article we concentrate on model (1) which we regard as "the simpler" 
reasonable model which describes a dependency within the colony. 

Remark 1. Since a Gaussian r.v. may take arbitrarily big negative val- 
ues, here we allow the growth rate to take negative values. However, provided 
we correctly estimate the parameters, this should happen with extremely 
small probability. 
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We aim at the following: 

(1) estimating the 4-dimensional parameter 6 = (uq, l3Q,ai, (3i), p and cr^, 

(2) testing the null hypothesis Hq = {(aO)/3o) = against its al- 
ternative Hi = {(ao,/3o) / {oii,(3i)}. 

In view of the biological question addressed here, point (2) is crucial: reject- 
ing Hq comes down to accepting that the dynamics of the growth rate of 
the old pole offspring is different from that of the new pole offspring. We 
shall actually see that the old pole progeny cell experiences slowed growth 
rate and, hence, should be considered an aging parent repeatedly producing 
rejuvenated offspring. 

Bifurcating autoregressive (BAR) models, such as model (1), have already 
been studied. Cowan and Staudte [9] were pioneers and studied model (1) in 
the special case when (ao;/3o) = (on, Pi), that is, under Hq. Several exten- 
sions [2, 3, 4, 8, 14, 15, 16, 21] followed, improving inference results or/and 
generalizing the model, but no distinction was ever made between new and 
old poles. In mathematical terms, in all these articles, the distribution of 
^2n given Xn has always been assumed to be the same as the distribution 
of X2n+i given Xn- Now, detecting a discrepancy between these distributions 
is the central question addressed here. Hence, model (1) generalizes exist- 
ing BAR models and allows us to detect dissymetry between sisters. Such a 
generalization is a source of mathematical difficulties. For instance, there is 
no stationary distribution in the sense of [9], that is, to say a distribution 
common to all cells in the clone. Besides, {Xn,n > 1) does not converge in 
distribution. For example, by computing characteristic functions, it is easy 
to see that the sequence of always new poles' growth rates (X2n,n > 1) and 
that of always old poles' growth rates (X2n+i„i, n > 1) both converge in dis- 
tribution, but, unless Oq = a{ and [3q/{1 — oq) = /?i/(l — ai), the (Gaussian) 
limit distributions are distinct. This leads us to develop a new theory (see 
Section 2). Note that in a recent work Evans and Steinsaltz [11] also address 
the question of dissymetry between sisters' growth rates, by proposing a su- 
perprocess model for damage segregation and showing that optimal growth 
is achieved by unequal division of (deterministically accumulating) damage 
between the daughters. 

We shall call X = {Xn,n > 1) a bifurcating Markov chain. The next sec- 
tion is devoted to the study of this family of stochastic processes. Estab- 
lishing laws of large numbers and central limit theorems will be crucial in 
achieving the two above objectives. That is the reason why we will pay 
special attention to such limit theorems. 
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2. Bifurcating Markov chains. Limit theorems. 

2.1. Definitions. Markov chains (MCs) are usually indexed by the inte- 
gers. Here we give a definition of a MC when the index set is the (regular) 
binary tree T. We then speak of a bifurcating Markov chain or a T-Markov 
chain, which we often write T-MC. T-MCs are well adapted to modeling 
data on the descent of an initial individual, where each individual in one 
generation gives rise to two offspring in the next one. Cell lineage data, such 
as the one presented in Section 1.2, are typically of this kind. 

Let us introduce some notation about the binary tree T; see Figure 4. 
Each vertex n G T is seen as a positive integer n € N*. It should be thought 
of as an individual or a cell. It has exactly two daughters, 2n and 2n + 
1, and we label the root 1. We denote by Gq = {2i,2i + l,...,2'?+i - 1} 
the qth generation and by = \Jq=Q Gq the subtree consisting of the first 
r + 1 generations. With this notation, Gq = {1} and, | • | standing for the 
cardinality, \Gq\ = 2"? and \Tr\ = 2^'+^ — 1. We also denote by Vn = [log2nJ 
the generation of individual n, that is, ?i € Gr„- In terms of labeling the 
vertices, T is assimilated to N*, but the topology is different: within T, n 
and 2n (resp. n and 2?! + 1) should be seen as neighbors. 

Let {S,S) be a metric space endowed with its Borel tj-field, and think of 
it as the state space. For instance, in the BAR model (1), S = M. For any 
integer p>2, we equip with the product cr-field, say, S^. 

Definition 2. We call T-transition probability any mapping P : S x 
5^ ^ [0, 1] such that: 

• is measurable for all AeS'^, 

• P{x, •) is a probability measure on (5*^,5^) for all x £ S. 

We also define, for x € 5 and BeS, Poix, B) = P{x, BxS) and B) = 
P{x,S X B). Pq and Pi are transition probabilities on (S,S). In the BAR 
model (1), they respectively correspond to the transition probabilities of the 
new poles and of the old poles. 

For p > 1, we denote by 6(8^) [resp. Bb{SP), C{SP), ^(5^)] the set of ah 
5^-measurable (resp. 5^-measurable and bounded, continuous, continuous 
and bounded) mappings f : ^M. If p € {2, 3} and / € B{SP), when it is 
defined, we denote by Pf € B{S) the function 



Let {J^riT G N),P) be a filtered probability space and, defined on it, 

{Xn,n € T) be a family of S- valued random variables. Let vhe a probability 
on (S, S) and P be a T-transition probability. 




if p = 3. 
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Definition 3. We say that (X„,nGT) is a (:?>)-bifurcatmg Markov 
chain, or (^r)-T-MC (with initial distribution v and T-transition probabihty 
P), if: 

• Xn is -measurable for all n G T, 

• X\ has distribution 

• for all g G N and for all family (/n,™ G Gg) in ^^(S^), 



E 



ri6G„ 



This means that, given generations to q, Tg, one builds generation 
Gg+i by drawing 2"? independent couples (^2^, -^^2^+1) according to P{Xn, •) 
(n G Gg). A (^^)-T-MC is also a (:^^)-T-MC, where J^^ = a{Xi,i G T^). 
When unstated, the filtration implicitely used is the latter. Note that for 
/ G Bb{S'^), K[f{X2n, X2n+i)\^rn] factorizes through the random variable 
Xn, so that E[f{X2n,X2n+i)\^rn] = ^[f iX2n, X2n+i)\Xn]. This means that 
any X„ depends on past generations only through her mother. This explains 
why we speak of a Markov chain (with memory one). 

Last but not least, note that, contrary to much of the (still sparse) 
literature on the subject, we allow conditional dependency between sis- 
ters. Conditional independence corresponds to the case when P factor- 
izes as a product Pq (8 -Pi of two transition probabilities on {S,S), that is, 
P{x,dydz) = Po{x,dy)(S)Piix,dz) for all xGS. [1, 5, 17, 18, 19, 23, 24] deal 
with more general than binary — and possibly random — trees, but all assume 
that, conditionally on their mother's type, the daughters have independent 
and identically distributed types. In our case, this corresponds to conditional 
independency with Pq = Pi. As said in Section 1.3, to our best knowledge, 
BAR models, although they allow for conditional dependence, have always 
been studied until now under the assumption that Po = Pi. Now, detecting 
that Pq ^ Pi will be the central question when we study cellular aging (see 
Section 3). Moreover, like in [1], we consider general state spaces whereas 
[5] deal only with countable ones and [17, 18, 19, 23, 24] only with finite 
ones. Note that in the latter case, one may regard a T-MC X as a multitype 
branching process and apply Maaouia and Touati's identification techniques 
[19]. 



2.2. Weak law of large numbers. 



2.2.1. Introduction. A first natural question is to know whether a T- 
Markov chain (X„) obeys laws of large numbers (LLN), that is, convergence 
of empirical means. 

Given / G B{S) and a finite subset / C T, let us write Mi{f) = J2i£i fi^i) 
and Mj{f) = |/|~-'^M/(/). Several empirical averages can be considered: 
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• One may average over the qtli generation, that is, compute M<Q^{f). 

• One may prefer to average over the first r + 1 generations, that is, compute 
Mf^{f). This is meaningful because Gg naturally precedes G^+i, since one 
cannot draw the whole {q + l)th generation without having completely 
drawn the g'th one. 

• One may also average over the "first" n individuals, that is, compute 

Y^^=i f i^i) ■ However, there is no natural order within a generation 
Ggi all the individuals (X„,n G Gg) of the gth level can be generated 
simultaneously. That is why we introduce the set S of all permutations 
of N* that leave each Gg invariant and, for / G B{S) and vr G G, consider 
the sums 

m:(/)=x:/(^^»)- 

i=l 

As far as the asymptotic behavior of M^(/) = n~^M^{f) is concerned, the 
choice of vr matters. This is because each new generation is essentially the 
same size as the total of all previous ancestors. To illustrate this, consider 
the following example. 

Example 4. Assume that 5 = {0, 1}, / = ids and, whatever the mother's 
type, = 1 and X2n+i = — in other words, P{x,dydz) = 6i{dy)6Q{dz) 
for all X G S", where 5x stands for the Dirac mass at point x. If vr G 6 sends 
the "first half" of each Gg, that is, {29,29 + 1,...,3 • 2'^"^ - 1}, onto the 
even elements in Gg, that is, Gg n (2N), then liminf^^oo Afn(/) — ^/^ 
lim sup„_,oo Afn.(/) = 2/3: the empirical average M^{f) oscillates between 
1/2 and 2/3. Conversely, if vr sends the "first half" of each Gg onto the odd 
elements in G^, that is, Gg n (2N + 1), then liminf^^oo -^n(/) = 1/3 and 
limsup„^oo M„(/) = 1/2. But for vr = idpj*, M„(/) converges to 1/2. 

• A natural answer to this issue is to explore each new generation "by 
chance," that is, to draw a permutation IT "uniformly" on ©, indepen- 
dently on X = {Xn , n G T) . Drawing 11 "uniformly" on & means drawing 
the restriction of IT on Gg uniformly among the (29)! permutations of 
Gg, independently for each q. Then we consider the empirical average 
M"(/)^n-iMn(/), where 

n 

<(/)=5]/(Xn«). 

i=l 

Thus, we introduce extra randomness, but this will allow us to get through 
Liapunov's condition when we try to derive a central limit theorem for X. 
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Remark 5. Note that for all vr G S and r > 0, M^^^{f) = Mj^{f). Be- 
sides, for all r > 0, 




and, for all vr € S and n > 1, 



1 ^n — 1 \(r< \ 1 

(4) iM^if) = E + ^ E /(^n«) 

q=0 i=2''i 

(we systematically use the convention that a sum over an empty set is zero) . 

Because of the branching, empirical averages of T-MCs may not behave 
like corresponding MCs ones. Precisely, given a transition probability R, a 
LLN may hold for MCs with transition probability R and fail for T-MCs 
with T-transition probability R^ R. A very simple but crucial illustration 
of this is Example 6. At least for the case of finite state spaces, let us keep 
in mind that periodicity is problematic and that there is more to ask than 
irreducibility and recurrence for a T-MC to obey a LLN. 

Example 6. Consider the two-state MC, say, S = {0, 1}, with R{0, •) = 
6i and •) = 5o- For a MC Y , a LLN holds true: whatever the initial state, 
the empirical average ^J2i^=i^i converges to 1/2 when n grows to infinity. 
But for the corresponding T-MC X with T-transition probability P = R0 R, 
that is, P{0, •) = 5i (8) 5i and P(l, ■) = (5o (S) ^O) it endlessly fluctuates from 
1/3 to 2/3. 

2.2.2. Results. Here we ask ourselves whether the various empirical av- 
erages introduced in Section 2.2.1 converge, in quadratic mean, when the 
size of the tree grows to infinity. We then speak of weak LLNs. A suffi- 
cient condition for a weak LLN to hold appears to be the ergodicity (see 
Definition 7) of the induced MC {Yj.,r G N), defined as follows. Start from 
the root and recursively choose one of the two daughters tossing a balanced 
coin, independently on the T-MC X. In mathematical terms, Yq = Xi and 
if Yr = Xn, then Yr+i = -^2n+Cr+i fo'^ ^ sequence of independent balanced 
Bernoulli r.v. (C(j,?€N*) independent on (X,Il). Here, "balanced" means 
that F{Cg = 0) = P(Cg = 1) = 1 /2. It is easy to check that {Yr,r£ N) is a MC 
with initial distribution i' and transition probability 

Definition 7. We say that a MC Y is ergodic if there exists a prob- 
ability fi on {S,S) such that linir^oo^xif (Yr)] = Jg f dfj, for all x G S and 
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Then /x is the unique stationary distribution of Y, and the sequence 
(Yr,r G N) converges in distribution to /x. Sufficient conditions for ergod- 
icity may be found in [6, 20]. We are now in the position to state the main 
theorem of this section: 

Theorem 8. Assume that the induced MC {Yr,r G N) is ergodic, with 
stationary distribution ^. Then, for any f €Ch{S), the three empirical aver- 
ages Mg,(/), Mt,(/) and Mn{f) converge to {fj.,f) in L^. 

Remark 9. It is noteworthy that the asymptotic behavior of the three 
above empirical averages depends on the T-transition probabihty P only 
through Q = (Po + A )/2. 

Remark 10. Athreya and Kang [1] use an analogous ergodicity hypoth- 
esis to get laws of large numbers. Namely, their results hold for Galton- 
Watson trees in which particles move according to a Markov chain R on 
{S,S), and they assume limm.^oo -^"^(2;) ■) = ^- If this happens uniformly in 
X on the compact subsets of S, they get a weak LLN; if this happens uni- 
formy in x on S, they get a strong LLN. Observe that we do not assume 
any uniformity in x (but our tree is deterministic). 

In the application (Section 3) the function / will typically be unbounded 
so that we shall actually prove an extended version of Theorem 8. To this 
end, let us first introduce some notation. We denote by: 

• i Aj the most recent common ancestor of i,j € T, 

• / (g) 5 the mapping {x,y) ^ f{x)g{y), 

• the pth iterated of Q, recursively defined by the formulas Q^{x, ■) = 5x 
and QP+^{x,B) = JgQ{x,dy)QP{y, B) for ah B eS; QP is a transition 
probability on (5, S) , 

• uQ the distribution on (5,5) defined by vQ{B) = Jgu{dx)Q{x,B); vQp 
is the law of Yp, 

• {Qf){^) = Is fiy)Qi^>dy) when it is defined, 

• u{f) or (z^, /) the integral Jg f du when it is defined. 

With such a notation, for any distribution A, transition probability Q and 
function / G 13{S) such that XQ{\f\) < 00, we have \Q{f) = X{Qf) which is, 
hence, simply written XQf. 

Now, let F denote a subspace of B{S) such that: 

(i) F contains the constants, 

(ii) F^CF, 

(iii) F(S)FC L^{P{x, •)) for all xeS, and P{F (g)F)cF, 
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(iv) there exists a probability measure /x on {S,S) such that FcL^(^) 
and lim.r^oo^x[f (Yr)] = (a*, /) for all x G S and f £ F, 

(v) for all f F, there exists g ^ F such that for all r G N, IQ^'/j < 5, 

(vi) F(lL\u), 

where we have used the notation = {/^|/ € -F}, F <^ F = {f ®g\f,g G F} 
and PF = {-P/l/ € F} whenever an operator P acts on a set F. Note that 
(i) and (iii) imply the condition 

(iii') for z = 0, 1, F C L^{Pz{x, •)) for all x G 5 and P,F C F, 

since PqJ = P{f (8) 1) and Pif = P{1 (8) /)• This in its turn implies 

(iii") F C L^{Q(x, •)) for ah X G 5 and QF C F, 

so that in (iv) and (v) Ea;[/(1^)] = Q^'f{x) is well defined. Note also that if 
F contains enough functions, that is, if it contains the set of all bounded 
Lipschitz functions, then fj, is the unique stationary distribution of Y, that 
is, fiQ = fi. The next theorem states that the result in Theorem 8 remains 
true for /'s in such a F: 

Theorem 11. Let F satisfy conditions (i)-(vi) above. Then, for any 
f &F, the three empirical averages M(Q^{f), Mf^{f) and M^{f) converge 
to {fi, f) in !?■ . 

Obviously F = Cfe(5) fulfills conditions (i)-(vi) as soon as Y is ergodic, so 
that Theorem 11 implies Theorem 8. In Section 3 we shall take F to be the 
set of all continuous and polynomially growing functions. 

We shall also need an easy extension of Theorem 11 to the case when 
/ does not only depend on an individual Xj, but on the mother-daughters 
triangle (Xj, X2i, X2i+i). This will be useful in the application (Section 3). 
Let us denote A„ = (X„, X2ni ^2n+i) and, for / G B{S^) and / a finite subset 
of T, 

n 

Mi{f) = Y^f{^i) and Mn(/) = ^/(An(.)). 
Then we have the following: 

Theorem 12. Let F satisfy conditions (i)-(vi). Let f G B{S'^) such 
that Pf and Pf^ exist and belong to F . Then the three empirical averages 
MG,{f), MjAf) andM^if) converge to {fi,Pf) m L\ 
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2.2.3. Proofs. This section is devoted to the proofs of Theorems 11 and 
12. 

Proof of Theorem 11. Considering the function / — (/i, /) leaves us 
with the case when (/x, /) = 0. Then condition (iv) imphes that 

(5) yxeS hm Q^'fix) = 0. 

r— >oo 

We shah study the three empirical averages M^^{f), Mx,. (/) and m|^(/) 
successively. 

Step 1. Let us first deal with MiQ^{f). First note that f{Xi) € Lp' for all 
i G Gq. Indeed, there is a unique path (zi, . . . , z^) G {0, 1}'' in the binary tree 
from the root 1 to i; here should be seen as the realization of 

the coin toss r.v. ((^i, . . . , Qq) that joins 1 to i. For instance, (1, 0, 0, 1) is the 
path from 1 to 25. Thus, 

which, from (ii), (iii') and (vi), is finite. 

Independently on X, let us now draw two independent indices Iq and Jq 
uniformly from Gq. Then f{Xj^)f(Xj^) £ and we have 

iE[MG,(/n = ^ E nfix^)f{x,)] 

= E[f{Xj^)f{Xj^)]. 

Let us fix p G {0, . . . , and reason conditionally on the event {Iq A Jq £ 
Gp}. Then Iq A Jq is uniformly distributed on Gp, so that Xj^/^j^ has the 
same distribution as Yp, that is, has distribution i/Q^ . Besides, for p < q, 
conditionally on the states (-'^2(/5AJ,)' ^2(/qAJ9)+i) of the two daughters of 
Iq A Jq, Xj^ and Xj^ are independent and have the same distribution as 
y^-p-i with respective initial conditions ^2(/,aJ5) -'^2(/,aJ5)+i- Provided 
we use the convention that P{Q~^ f Q~^f) = we then have 

(6) E[f{Xj^)f{Xj^)\Iq AJqe Gp] = uQPp{Q'^~P''f Q^-P-'f). 

Now, F{Iq AJq£ Gq) = F{Iq = Jq) = and, for p G {0, . . . , g - 1}, P(/g A 
Jq G Gp) = 2"^^^. Indeed, since Iq and Jq are independent, the paths (Ci , • • • , 
<^g) from 1 to Iq and ((^/, . . . Xq) from 1 to Jq are independent so that for 
pe{0,...,q-l}, 

Fiiq AJqe Gp) = P(cf = C/, . . . , Cp' = C/, Cp'+i 7^ C/+i) = 
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In short, we write F{Iq A Jq e Gp) = 2~^~-^{p<9}. Combined with (6), this 
finally gives 

IE[^s(/)']=E[/(X,J/(XjJ] 

p=0 

Let us now fix e > and choose G N such that 2"^^ < e. Then 
X]p>p^ 2~P < e. Besides, from (iii), (v) and (vi), there is a c/ > such that 

(8) V ® Q-^-P-V)! < c/ 
for all < p < q. Hence, for q> p^, 

(9) E[Mg,(/)2] < ecy + £ |z.Q^'P(Q'^"P"V ^ 

From (v), there exists g G F such that, for all r G N, \Q^ f (8) f\ < g ® 9- 
From (iii), P{g®g) G F so that, using (v) and (vi), P{g®g) G flpsN -^^(^Q^)- 
This shows that the sequence of functions {Q'^ f f,r G N) is dominated 
by G f]p^^L^{i'QPP). Then (5) and Lebesgue's dominated convergence 
theorem imply that 

(10) VpGN lim i/Qmg V® QV) = 0. 

r—*oo 

As a consequence, the r.h.s. of (9) converges to ecj as q grows to infinity. 
Since e is arbitrary, the proof is complete for M{Q^{f). 

Convergence results for Mt,,(/) or M„ (/) may be easily deduced from 
those for MQ^{f) by using (3) or (4) and the following lemma. 

Lemma 13. Let {ur,rGN) be a sequence of nonnegative real numbers 
converging to 0. Let 

E\'^q\ , I'^gl 
—— tig and an= 2^ Ug. 

Then {vr,r G N) and {an,n G N*) converge to 0. 

Proof. Let us fix e > 0. We can find qe G N* such that Uq < e for all q > 
q£. Letting M = sup^^j!^ Uq, we then have, for all r > qe, Vr < ^ + -^Z]qL'o^ jr^- 
The r.h.s. tends to e as r grows to infinity, so that lim^—^oo^'r- = 0. 

As for {an,n G N*), it is enough to notice that, since |Tr,^_i| < n, = 
Vr„-i\Tr„-i\/n < Vr„-i and to apply the result for {vr,r G N). □ 
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Step 2. Let us now treat Mj^{f). From (3), we have by the triangle 
inequahty ||Mt,(/)||l2 < E^=o f^ll^G,(/)||L2. From Step 1, Mg,(/) con- 
verges to in quadratic mean. Lemma 13 imphes that the r.h.s. tends to 0, 
which ends the proof for Mj^{f). 

Step 3. Eventually, let us look at M^{f). From (4) and the triangle in- 
equality, \\M^{f)\\L2 <an + bn, where 



£'^||Mg,(/)||^. and 6„, 



q=0 



1 



L2 



Since Mg,^{f) converges to in quadratic mean. Lemma 13 implies that 
lim„^oofln = 0. As for 6„, first note that since each f{Xi) belongs to L^, 
/(^n(i))/UnO)) e for ah i,j G {2^^", . . . ,n} so that 

bl = ^ J2 IE[/(^n»)/(^n(i))]. 

Let us compute the latter expectation, depending on i= j or j. For 
all i S {2''", . . . ,n}, n(z) has the uniform distribution on Gr„ so that when 
i = j, E[/(Xn(i))/(XnO))] =E[/'(^nw)] = i^Q'^f- Let us now treat the 
case when i ^ j. Then r„ > 1. Independently on (X, 11), draw two indepen- 
dent indices /r„ and Jr„ uniformly from Gr„- Then since j, the law of 
(n(z),n(j)) is the conditional law of {Irn,Jrn) given ^ Jrn} so that 

E[/(Xn(.))/(Xno-))] 

= E[f{Xj,JfiXjJl{j^^^j^J/¥iIr„^JrJ 

= (1 - 2-^-r'nf{XiJf{XjJ^j^^^j^J 
= {l-2-^-)-\E[f{XjJf{XjJ]-E[f{XjJl{i^^^j^J) 
= (1 - 2-^-)-\E[f{XjJf{XjJ]-nfiXj,J]nir,. = JrJ) 
= (1 - 2-'^-)-\E[f{XjJf{XjJ] - 2-^--uQ^-p) 

= (1-2-'^")-^ 2-f-VQPp(Q''"-P-V0Q''"-P-V), 

p=0 

where we have used P(/r„ = Jr^) = 2"''" in the second and fifth equalities, 
the independence of {X,IrJ) and in the fourth one, the fact that 

Xi^^ has the same distribution as in the fifth one and (7) with q = rn 
in the last one. Eventually, we have proved that 6^ = 6^ + b'^ with 
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^ ' p=0 

Since (n — 2*"" + l)/n^ < 1/n, and using (ii), (v) and (vi), lim.„_»oo^n = 0- 
As for 6^, let us fix e > and choose Pe € N* such that 2"^= < e. From (8), 
there is a c/ > such that \vQPP{Q''^-P-^ f ® Q'-^'P'^ f) \ < Cf for all p and 
n such that r„ > p. Since (n - 2''")(n - 2*^'" + l)/n2(l - 2"*"") < 1, we then 
have 

Pe-l 

Now, using (10), we get that each term of the latter finite sum tends to 
as n tends to infinity, so that finally lim„,_+oo b'^. = 0, which completes the 
proof. □ 

Proof of Theorem 12. Considering the function / — {i_t,Pf) leaves 
us with the case when (fi, Pf) = 0. Let us treat the case of MQ^{f). Observe 
that /(Aj) G for all i S Gg. Indeed, (zi , . . . , Zq) denoting the path from the 
root 1 to i in the tree, E[/(Aj)'^] = i^Pzi ■ • • Pz^Pf^, which is finite from (iii') 
and (vi), since Pp G F. Thus, by conditioning on Tg, E[/(Aj)/(Aj)] = 
E[Pf{X,)Pf{Xj)] for all i^je Gq, and E[f{A,)]=E[Pf{X,)] for ah 
i GGq. Hence, 

E[Mg,(/)2]= ^ E[/(A,)/(A,)] 

= Y,E[pf{Xi)]+ npf{x^)pf{x,)] 

= E[Mg, {Pff] + E[MG,{Pf - {Pf)% 

so that 

E[MG(pf-{pm] 



E[Mg,(/)1=E[Mg,(P/)'] + 



We can apply Theorem 11 twice: limg^ooE[MG,(P/^ - (Pf)^)] = (m, 
PP - {Pf?) and lim,^ooE[MG,(^/)'] = 0, so that lim,^ooE[MG,(/)'] = 
0, that is, MiQ^{f) converges to in L^. Using Lemma 13, we extend this 

result to Mj^{f). The proof for M^(/) is similar to Step 3 of the proof of 
Theorem 11, with the same extra conditioning argument as above. □ 
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2.3. Strong law of large numbers. So far, we have proved the weak LLN, 
that is, convergence in quadratic mean for empirical averages. We now seek 
for strong LLN. Theorem 14 gives sufficient conditions under which the 
empirical averages over the qth generation and over the first r + 1 generations 
converge to a constant with probability one. 

Theorem 14. Let F satisfy conditions (i)-(vi). Let f G F such that 
{fi, /) = 0. Assume that there exists h^F such that 

Vr-eN / 

Then ALQ^{f) and Mj^{f) almost surely converge to as oo. 

Proof. Step 1. Let us first treat Mg^C/)- Let us write rig = E[MG^{f )'^]. 
It is enough to check that J2qmVq < oo- Now, using (7) and Fubini's theo- 
rem, we have 

E ^5 = E E 2-P-^i^<''}uQPp{Q'i-P-'f Q'^-P-'f) 

qeN (jeNp=0 

pen q=p 

pSN \ VrGN 

<Y,2~PuQP{f^ + h), 

which, from (v) and (vi), is finite, since f"^ + h(^ F. 

Step 2. Let us now deal with Mf^{f). From (3), we have |MTr^(/)| < 

E;=o ^|AfG,(/)|. From Step 1, a.s. lim^^oo = 0. It is enough to 

apply Lemma 13 to get that Mf^{f) a.s. converges to 0. □ 

In particular, we have the following: 

Corollary 15. Let F satisfy conditions (i)-(vi). Let f F such that 
(;U,/) = 0. Assume there exists c & F and a nonnegative sequence {K.r,r € N) 
such that J2reN < oo and 

VxG5,VrGN IQ"^ f{x)\ < c{x)Kr. 

Then MQ^{f) and Mj^{f) almost surely converge to 0. 
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Proof. From (iii), h = {T,reN^r)P{c ® c) G F and -P(ErGN IQ""/ ® 
f\) ^ h, so that Theorem 14 gives the result. □ 

Remark 16. In the case when for some k G (0, 1), we speak of 

geometric ergodicity. Geometric ergodicity imphes the almost sure conver- 
gence of Mg,(/) and Mt,(/). 

Remark 17. Assume that the state space S is finite, and that the in- 
duced MC Y is irreducible and aperiodic. Then Y is ergodic and, /i stand- 
ing for its stationary distribution, the sequence of functions {Q^g,r € N) 
uniformly converges to (/i, (?) with exponential speed. Taking F = B{S) and 
f = g — (p, g), Corollary 15 applies: MiQ^{f) and Mj^{f) almost surely con- 
verge to 0, that is, Mis^{g) and Afx,. ((7) almost surely converge to (fJ-jg)- 
This covers the main result in [18, 23, 24] when applied to the binary tree. 

In the case when / depends on the mother-daughters triangle 
A„ = {Xn, X2n, X2n+i) , we Can prove as well the following: 

Theorem 18. Let F satisfy conditions (i)-(vi). Let f € 6(5^) such that 
Pf and Pf^ exist and belong to F, with {fi,Pf) = 0. 

(i) Assume that there exists h^F such that -P(X]reN \ Q^ Pf Q^Pfl) ^ 
h. Then MQ^{f) and Mj^{f) almost surely converge to 0. 

(ii) In particular, if there exists cG F and a nonnegative sequence {Kr,r € 
N) such that J2rm t^r < 00 and 

VxG5,VrGN \Q''Pf{x)\<c{x)Kr, 

then Mg,^{f) and M'f^{f) almost surely converge to 0. 

2.4. Central limit theorem. We are now interested in proving a central 
limit theorem (CLT) for the T-MC {Xn). This will be done by using a CLT 
for martingales. 

Theorem 19. Let F satisfy (i)-(vi). Let f G B{S'^) such that Pf^ and 
Pf^ exist and belong to F . Assume that Pf = 0. Then n^^^'^M^if) con- 
verges in distribution to the Gaussian law Af{0,s'^), where = {^i^Pf"^). 

Proof. Let M^{f) = 0, T^o = c^(^i) and Hn = c7(An(i),n(i + 1), 
1 < i < n) for all n > 1. Note that Xn(j) is 7^j_i-measurable and that, con- 
ditionally on Tii-i, An(j) has distribution (5xn(i) ® P{^n(i)-, ')• Since Pf = 0, 
{M^{f ),n > 0) is a (7^„)-martingale. It has bracket 

n n 

(M"(/))n = ElE[/'(An«)|W.-i] = E ^/'(^nw) = Af„"(P/2). 

i=l i=l 
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According to Theorem 11, since Pf^ G F, the sequence n~^{M^{f))n con- 
verges to {fi,Pf'^) = in L^, and thus, in probabiUty. It remains to check 
Liapunov's condition, say, for the fourth moment, that is, to prove that the 
sequence of positive r.v. {Ln,n > 1) defined by 

1 " 

L„ = — 5]E[/4(An(.))|Hi-i] 
i=i 

tends in probabihty to (see, e.g., [10]). But L„ = M^{Pf)/n and M^liPf) 
converges to (/i, Pf^) in quadratic mean, so that L„ converges to in prob- 
abihty. □ 

In the general case when Pf ^ 0, we have: 

Corollary 20. Let F satisfy (i)-(vi). Let f € B{S^) such that Pf, Pf'^ 
and Pf^ exist and belong to F. Then n~^l'^{M'^{f) - M^{Pf)) converges 
in distribution to 7V(0, s^), where = {fi,Pf'^) - {Pf)"^). 

Proof. It is enough to apply Theorem 19 to the function g defined by 
9{x,y,z) = f{x,y,z) - Pf{x). □ 

Considering the subsequence of indices n = \Tr\, r S N, we can state the 
following: 

Corollary 21. Let F satisfy (i)-(vi). Let f e B{S^) such that Pf, Pf'^ 
and Pf^ exist and belong to F. Then \Tr\^/'^{AIj^{f) — Mj^{Pf)) converges 
in distribution to 7V(0, s^), where = - {fi, {Pf)"^). 

If we take F to be Ch{S), we get the following: 

Corollary 22. Assume that the induced MC {Yr,r ^fi) is ergodic, 
with stationary distribution n {see Definition 7). Then, for any f gCi,{S^), 
n~-^/^(M^(/) — M^{Pf)) converges in distribution to J\f{0,s'^), where = 

Remark 23. Note that the normalizing factor is the square root of the 
number of individuals, n or \Tr\, and not the square root of the number of 
generations, r„ or r, as one might have thought. Convergence is fast with r: 
with 20 generations (r = 19), the normalizing factor jT^I^/^ is approximately 
10^. 

Using characteristic functions, it is easy to generalize Corollary 22 to the 
case when / is vector- valued: 
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Corollary 24. Let F satisfy (i)-(vi). Let /i, • • • , /d € B{S^) such that 
Pfi, P{fifj) and Pififjfkfi) exist and belong to F for all i,j,k,l. Let 
f = (fi, . . . , fd) ■ Then n~^^'^{M^{f) — M^{P f)) converges in distribution 
to the d- dimensional Gaussian law A/rf(0,S), where T,ij = (fi, P{fifj)) — 
{l^,PfiPfj). 

3. Detection of cellular aging. 

3.1. Limit theorems in the BAR model (1). Here we seek to apply the 
results in Section 2 to model (1). 

3.1.1. Weak law of large numbers and central limit theorem. In this sec- 
tion we take F to be the set Cpoi(K) of continuous and polynomially growing 
functions, that is, the set of all continuous functions / : M ^ M such that there 
exists c > and m G N such that, for all x E M, 

i/(x)i<c(i+|xr). 

In order to apply Theorems 11 and 12 and Corollary 24, we need to check 
conditions (i)-(vi). Conditions (i) and (ii) are obvious. The next lemma 
states that condition (iii) is fulfilled too. 

Lemma 25. Let f,g e Cpoii^)- Then f ® g ^ L^{P{x,-)) andP{f(^g)e 

Cpol(IR)- 

Proof. Let Go,Gi be two independent standard Gaussian variables. 
Let 

Then G{x) = M{x) + AG has distribution P(x, •)• Hence, P{\f (g) g\){x) = 
E[|/ ® g{G{x))\\. Now, || || denoting the Euclidian norm in R^, we can find 
c>0 and me N such that \f ® g{G{x))\ < c{l + \\G{x)\\^) . Since < 
c(l + |x| + for a constant c and E[||G||™] < oo, we may eventually find 
a c > such that, for all x € M, 

P{\f^g\){x)<c{l + \xr), 

which completes the proof. □ 

Lemma 26. Cpoi(lR) fulfills conditions (iv)~(v); is the stationary dis- 
tribution of Y. 
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Proof. In the BAR model (1), the induced MC has the stochastic dy- 
namics 

(11) Yr+l=ar+lYr + br+l, 

where {{ar,br),r S N*) is a sequence of i.i.d. r.v., independent of Yq. Pre- 
cisely, Ur+i = a^.+i , br+i = + Er+i' where {e'g,q G N*) and {Cq,qeN*) 
are independent sequences of i.i.d. r.v., independent of Yq, each e'^ has law 
M{0, cr^) and each Q is a balanced Bernoulh r.v., that is, P(Cg = 0) = P(Cq = 
1) = 1/2. Bougerol and Picard [7] call the sequence {Yj.,r G N) a generalized 
autoregressive process. It is often simply called AR(1), for AutoRegressive 
of order 1, in the literature. We have 



Yj, — ttrClr—l ' ' ' 0.20-1^0 ~l" ^ ^ ClrOr—l ' ' ' (^k+l^k- 

k=l 

Since the r.v. {{ar,br),r G N*) are i.i.d., Yr has the same distribution as 

r 

(12) Zr = aia2 ■ ■ -ar-iarYo + ^ aia2 ■ ■ -ak-ibk- 



k=l 



Let us first prove (v). Let / G Cpoi(K) and x G M, and let us denote S = 
X^fcLi |aia2 • • -o-k-ibkl- From (12), \Zr\ < |lo| + S for all r G N, so that we 
can find c > and m G N* such that E^.[|/(Z,)|] < cdx]'" +E[5'"]). Now, let 
us denote a = max{|ao|, |Q!i|} < 1- Using the triangle inequality in the first 
line and the fact that the L™-norm of 6^, |[6fc||im = Cm, does not depend on 
k in the last one, 

oo 

ll'S'llLm- < ^ \\a1a2 ■ ■ ■ ak-ibkWi"^ 

(13) 



00 



k=\ k=l 

Eventually, \Q^'f{x)\ < E,.[|/(y,)|] = IE..[|/(^r)|] < c'^{l + l^l") for some c'^ 
which does not depend on r, which proves (v). 

Let us now prove (iv). Since \a1a2 ■ ■ ■ ar-iarYo] < 0*^1101) we have that 
a.s. limr_>oo oifl2 • ■ • flr-iOr^o = 0- Besides, the sum in (12) a.s. converges 
as r grows to infinity as E[|S'|'"] < 00. Eventually, the sequence {Zr,r G N) 
almost surely converges to 

00 

(14) Zoo = ^aia2---ak-ibk. 

k=l 

Let //denote the distribution of Zqo . ThenCpoi(M) C L^ifJ^)- Indeed, ||Zoo||j;^m 
< llSllj^m < 00 for all m G N. Let us eventually prove that linv^ooE^[/(yr.)] = 
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{fij). Since \f{Zr)\ < c{\Yor G ^^(P^) and {Zr,r£ N) almost surely 
converges to Zqo, we can apply Lebesgue's dominated convergence theorem 
and get that 

hm E,,[f{Yr)] = lim E,[/(Z,)] = E,.[/(Zoo)] = (fij). 

r — >oo r — >oo 

Condition (iv) is now fully checked, and fx is the unique stationary distribu- 
tion of Y. □ 

Let us denote by Cpoi(M^) the set of all continuous and polynomially 
growing functions / : ^ M. Since CpoKM^)^ c Cpoi(M3) and P{Cpoii^^)) C 
Cpoi(lR), Theorems 11 and 12 and Corollary 24 imply: 

Proposition 27. In the BAR model (1), assume that the distribution 
of the ancestor Xi, v, has finite moments of all orders. Let fi he the unique 
stationary distribution of the induced MC (Y^^r S N). Then: 

(i) for all f G Cpoi(M), M^.if), MjAf) and u'^if) converge to {fi, f) 
in L^, 

(ii) for allfe Cpom^), MG,{f), MtM) andu'^if) converge to {fi,Pf) 
in L^, 

(iii) for all /i, . . . , /rf G Cpoi(M=^), n-^/^{M^{f) - M^{Pf)) converges in 
distribution to J\fd{0,T.), where f = and T,ij = {fi,P{fifj)) - 

3.1.2. Strong law of large numbers. We can also derive almost sure con- 
vergence results: 

Proposition 28. With the assumptions of Proposition 27: 

(i) almost surely, for any f G Cpoi(IR), MiQ^{f) and Mf^{f) converge to 

, _ _ 

(ii) almost surely, for any f G Cpoi(M'^), MQ^{f) and Mj^{f) converge 

to {fi,Pf). 

Proof. Let us take F to be the set CjJoi(M) of all functions / : M ^ M 
such that I/I -|- l/'l is bounded above by a polynomial. One can easily check 
that Cj^oi(M) satisfies (i)-(v). 

Step 1. Let us first prove that, 

(15) V/ G P(^^hm Mg,(/) =;im Mt.(/) = (/i,/)) = 1- 

Let / G CjJqi(M). We want to apply Corollary 15 with F = C^oi(M) and to the 
function g = f- {fi, f) G i(M) . First note that Q''g{x) = Q''f{x) - {fi, /) = 



LIMIT THEOREMS FOR BIFURCATING MARKOV CHAINS 23 



Kx[f{Zr) — /(Zoo)], SO that using Cauchy-Schwarz's inequaUty, 

\Q''g{x)\<E.,[Wr\Zr - Zool] < {E.,[W^]E^[{Zr - Zoo)'])'/', 

where Wr = sup^g^^^^^^j |/'(z)|. We can find ci > and m G N such that 
for all z£R, |/'(z)P < ci(l + \z\"') so that, using (12) and (13), there is a 
c' > such that, for all x G R and r G N, 



sup ci(l + |2;]™) 

ze\Zr,Zoo] 



< ci(i +E,[|z,r] +E,.[|Zoon) < 4(1 + \xn. 

Moreover, Zr — Z^o = aia2 • • • ar-i^r^o — J^t^'+i^i^'i ■ ■ ■ CLk-ibk so that 
\\Zr - Zoo||i,2(p^) < a'x + II 0102 • • •afc-ifofc||^2 (" = max{|ao|, |ai|}). 

Now we have || Efc^^+i aia2 • • • afc-i^fc||^2 < C2 E/t^+i = C2a'"/(1 - a), 
where C2 = ||6fc||i,2 does not depend on k. Thus, we can find C3 > such that 
for all X G M and r G N, 

E,[(Z,-Zoo)']<C3a'^(l + x'). 

Eventually, 

(16) |Q^5(x)| < (4c3a2^(l + |xr)(l + x'))'/' < c(x)k„ 

with Kr = a'" and a function c G CpQi(]R). Corollary 15 implies that M^^{g) 

and Mj^{g) almost surely converge to 0, that is, M^qif) and MT^{f) almost 
surely converge to (/i,/), which proves (15). 

Step 2. Let us now prove that, almost surely, the empirical distributions 
and Mt^ weakly converge to /i. There exists a sequence {fp,p G N) 
of C°° functions with compact support which characterizes convergence in 
distribution. Hence, it is enough to show that, almost surely, for all p G N, 
liniq^oo -A^Gq (/p) = liiHr-^oo -^Tr (/p) = it^ifp)- But this immediately follows 
from Step 1, since fp G CpQi(]R). 

Step 3. Let us now prove assertion (i). Let us deal with Mq^ (the proof 
for Mj^ is similar). For A;, / G N, let us write fk,i{x) = k{l + x''). Since 
fk,i G C'pqj(IR), from Step 1, almost surely, 

(17) VA;,/GN l\mMG,{fk,i) = {f^,fk,i). 

From Step 2, the empirical distributions Mq^ a.s. weakly converge to /i. 
Besides, for all / G Cpoi(IR), there exists A;, / G N such that /^ < f^^i- Thus, 
from (17), a.s. for all / G Cpoi(M), the sequence {Mg,^{f'^),q G N) is bounded, 
which proves that a.s. every / G Cpoi(M) is {MQ^,q G N)-uniformly inte- 

grable. Hence, a.s., for all / G Cpoi(K), MiQ^{f ) converges to (//,/). 
Step 4. The proof of (ii) is similar to the proof of (i). □ 
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Remark 29. A natural choice for v is the stationary distribution /i. 
Indeed, the ancestor X\ is picked from a metacolony that has evolved for a 
long time, so that in the BAR model (1) its distribution should be close to 
/i. With this particular choice, we can apply Propositions 27 and 28. Indeed, 
/i has finite moments of all orders, since Cpoi(lR) C L^iji). 

3.2. Estimation of the parameters. We seek to estimate the 4-dimensional 
parameter 6= (oq, l3o,ai, I3i), as well as a'^ and p. Assume we observe a 
complete subtree T^+i. Then, since the couples (e2i,e2i+i) are i.i.d. bivari- 
ate Gaussian vectors, the maximum likelihood estimator 6^ = (q;q, /3q, a^^, 
of 6 is also the least squares one: for e G {0, 1}, 

Pe = iTTrT"^ J2i£Tr ^2i+£ " aol'^rl""^ LieT^ ^i- 

Hence, Oq (resp. ai) is the empirical correlation between new (resp. old) 
pole daughters and their mothers. We shall denote by xy (resp. x,y,x^) the 
element of Cpoi(]R'^) defined by {x,y,z) ^ xy (resp. x,y,x'^). 

Remark 30. Note that (/x,x^) — (/^,x)^ > 0. Indeed, it is nonnegative, 
and if it were 0, fi would be a Dirac mass. Now a Dirac mass cannot be 
stationary for Y because fj^ > 0. 

Proposition 31. In the BAR model (1), assume that the distribution 
of the ancestor Xi, v, has finite moments of all orders. Then {6^,r G N) is 
a strongly consistent estimator of 6. 

Proof. Let us treat Oq. Convergence of Pq, al and /3[ may be treated 
in a similar way. Note that Oq = Cr/B^ with 

Cr = Mt, (xy) - Mt, (x)Mt, (y) and Br = Mt^ (x^) - Mj,. (x)^. 

Since P(xy)(x) = x(aox + /3o) and P{y){x) = a^x + /3o, Proposition 28 im- 
plies that Cr a.s. converges to (/i,x(aox + /3o)) — x)(/x, aox + /3o) = 
ao((/^,x^) — (/_i,x)^) and Br a.s. converges to (/x,x^) — (^,x)^, which from 
Remark 30 is positive, so that ctg a.s. converges to ao- D 

Remark 32. Let us denote a = (ao + Q!i)/2, /3 = {(3q + (3i)/2 and so on. 
Then 

(18) (/^,x) = - and (/i,x) = — = . 

I- a l-a^ 

Indeed, recalling (11) and (14), Z^q has the same law as aiZ^o + bi, where 
the pair (ai,6i) is independent of Z^o and takes values {ao,f3o) and (ai,/3i) 
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with probability 1/2. Hence, (/^f,x) = E[Zoo] = E[aiZoo + ^i] = + (3, 

as announced. Likewise, 

ifi, x2) = E[Zl] = E[(aiZoo + = E[a?Z^] + 2E[ai6iZoo] + E[6?] 

= a2(//,x^) + 2a^(^,x) + ^ + a^, 

from which we deduce the second equahty in (18). 

From the preceding remark, we define two continuous functions fxi : @ 
and //2 : X ^ R by writing 

(19) (^,x)=/ii(^) and (^f,x2)=/i2(^,(72), 

where 6 = {ao, (3Q,ai, jSi) G 6 = (—1, 1) x M x (—1, 1) x M. Let us now build 
a confidence region for 6. 

Proposition 33. In the BAR model {!), assume that the distribution 
V of the ancestor X\ has finite moments of all orders. Let fi be the unique 
stationary distribution of the induced MC {Yr,r G N). Then \Tj.^^'^ {9''' — 9) 
converges in law to A/'4(0,S'), where 

\pK K 



(20) 



with K 



1 



1 



PROOF. For /i, . . . , /rf G Cpoi(R3), we denote f = {fi, . . . , fd) and U'Xf) = 
|T^|^/^(Mt,(/)-Mt,(P/)). Let us denote (r = \Tr\^^^i9'' - 9). We first ob- 
serve that (r = vi^^ {f)^ ^r, Br) with / = (xy,y,xz,z), ip{u,a,b) = M{a,b)u, 

/ 1/6 -a/b \ 

-a/b {b + a^)/b 

^^^^"'''^ 1/6 -a/b 

V -a/b {b + a'^)/bj 

Ar = Mt,(x) and S,. = Mt,(x2) -Mt,(x)^ From Proposition 27(iii), W{f) 
converges in distribution to G~ A/4 (0,11) with 

V = ^2 l^l{0) 1 Pl^l{0) P 

pp2ie,a^) pplie) P2{9,<y^) pi{9) ■ 
V ppi{e) p pi{9) 1 J 

Besides, Proposition 27(i) implies that {A,., Br) converges in law to the con- 
stant (a, 6) = {pi{9), p2{G,(^'^) — Pii^)^)- Thus, Slutsky's theorem states that 
{V {f),Ar,Br) converges in law to {G,a,b). Then, by continuity of (p on 
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M X R X M:; and recalling from Remark 30 that 6 > 0, Cr = ifiW (f), Ar, Br) 
converges in law to ^p{G, a, b) = M{a, b)G, which is a centered Gaussian vec- 
tor with covariance matrix E' = M{a,b)Y!,M{a,b)'' . Now we have 



S = a^f^, withL=r^^^(^;;;) ^f) 



M{a, b) 



K 
K 



Since LK = I2 , the 2x2 identity matrix, we get 

2fK 0\fL pL\(K 0\_ 2fK pK 
Kj\pL 

which completes the proof. □ 



^'-''^O K)[pL 'l)[0 k)-''\pK K 



We also need to estimate the conditional variance, cr^, and the conditional 
sister-sister correlation, p. Since cr^ is the common expectation of the i.i.d. 
r.v. (e^, I > 2), it is naturally estimated, given a complete observation (Xj, i G 
Tr+i), by 



where 

£2n+l = ^2n+l — Oi^Xn — Pi, 

are the residues. Likewise, since /) = Cov(e2i,e2i+i)/f^5 it is naturally esti- 
mated by 

P'^ ^ -2 ITT I X! ^2i£2i+l- 

We have checked that {af.,pr) is the maximum likelihood estimator of 

Proposition 34. In the BAR model (1), assume that the distribution 
of the ancestor Xi, u, has finite moments of all orders. Then ((cj^, p,.), r € N) 
is a strongly consistent estimator of {a'^,p). 

Proof. Let us first deal with a^. Observe that 
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where f{A,9) = {y — aox — /3o)^ + ~ (^i^ ~ Pi)'^: with A = {x,y,z) and 
6» = (ao,/9o,ai,/9i)- Thus, we have = MT^{f{-,e))/2 + Dr, with 

Since f{-,0) £ CpoiiM^), we can apply Proposition 28(ii): 'Mj^{f{-,0)) a.s. 
converges to (/x, e))). Now, =_E,.[(X2 - ao^i - f3o? + 

(X3 - aiXi - (3if] = E[el + £§] = 2a^. Hence, Mt,(/(-,^)) a.s. converges 
to 2(7^. Thus, it is enough to prove that a.s. tends to 0. Let us write 
e = {61,62,93,94) G G = (-1, 1) X M X (-1, 1) X R. From the Taylor-Lagrange 
formula, for any A G M'^ and 6, 6' € Q, we can find A S (0, 1) such that 

4 

/(A, 6') - /(A, 6) = Y^{6'^ - 6,)deJiA, 6 + A(e' - 6)). 

i=i 

Now, observing that / is a polynomial of global degree 4 and of degree 2 
in each 6^, we can find g £ Cpoi(M^) such that for ah j € {1, 2, 3, 4}, A G 
and 6£e, \deJ{A,6)\ < c/(A)(l + \\6\\). Therefore, for all r G N, 

\Dr\<]- \\9^ -9\\^Y1 9i\)il + ll^ll + ll^'^ - ^11) 

= ^\\9- -9\\il + \\6\\ + W -6\\)MjM- 

From Proposition 28(ii), M'f^{g) a.s. converges. Besides, Proposition 31 
states that \\9'^ — 6\\ a.s. tends to 0. As a consequence, so does Dr- This 
completes the proof for o"^. The proof for p is very similar. □ 

3.3. Detection of cellular aging. As explained in [13], detecting cellu- 
lar aging boils down, in the BAR model (1), to rejecting hypothesis Hq = 
{{ao,f3o) = (ai, Let us now build a statistical test that allows us to seg- 
regate between Hq and its alternative Hi = {(ao;/3o) 7^ Wald's test 
is well adapted to the situation. We write fii^r = /^i(^r) and /i2,r = l^2{6r,^r) 
[recah (19)]. 

Proposition 35. In the BAR model (1), assume that the distribution of 
the ancestor Xi, v, has finite moments of all orders. Then the test statistic 

It I 2 

converges in distribution to X^(2), the distribution with two degrees of 
freedom, under Hq, and almost surely diverges to +oo under Hi. 
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Proof. Recall that 9 = (uq, l3o,ai, I3i). Let us set g{6) = (uq — ai,/3o — 
Pif. Then Hq = {g{9) = 0}. From Proposition 33, |T^|^/^((9'" - d) converges 
in lawto A/'4(0,S') so that \Tr\^^'^{g{6'')-g{0)) converges in law to 7V2(0, S"), 
with 

i:" = dg{e)^'dg{ey 



pK K J y-h. 

Under Hq, g{9)=Q so that \Tr\^l'^ g{e'') converges in law to G~AA2(0,S"). 
Now, from (18), K = K{9,a) is a continuous function of {0,a) G x so 

that, letting = K{9^, dr), Propositions 31 and 34 imply that S^' = 2(T^(1 — 
Pr)Kr converges in probability to S". By continuity of Gi— > Slut- 
sky's theorem shows that iTj-l^/^Sr ^^'^g{9'^) converges in law to A/2(0,/2). 
In particular, 

|||T,|V2s;'-i/2^(0>)f = \T,\gi9-Yt':r'g{9^-) = ^. . ^ gin^K'^gin 

equals and converges to the distribution with two degrees of freedom. 
Under Hi, Xr/\Tr\ = g{9''Yt';-^g{9'') a.s. converges to g{9YT,"-^g{9) > 

so that Xr-^'* a.s. diverges to +oo. □ 

The same technique may be used to test {ao = ai}: 

Proposition 36. In the BAR model (1), assume that the distribution of 
the ancestor Xi, v , has finite moments of all orders. Then the test statistic 

2cr2(l -p,.) 

converges in distribution to x^(l) under {uq = ai} and a.s. diverges to +cx) 
under {oq cki}- 

The same can be done for testing {Pq = Proposition 31 provides 
natural statistics to test {uq = 0}, {ai = 0}, {/3o = 0}, = 0} and {/?o = 
Pi}. We do not give details for the sake of brevity. 

We now present a statistical test that allows us to differentiate between 
Hq = {/?o/(1 — ao) = /?i/(l — oi)} and its alternative . This allows to test 
if the two fixed points corresponding to the two affine regressions of the BAR 
model (1) are equal. This may happen even if (ao,/3o) 7^ (ai,/3i). Rejecting 
Hq means accepting that the new pole and the old pole populations are 
even distinct in mean. Again, we use Wald's test, since Hq = {g{9) = 0} 
with g{9) = Pq/{1 — ckq) — /3i/(1 — ai). The proof is obvious and not detailed 
here. 
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Proposition 37. In the BAR model (1), assume that the distribution 
of the ancestor Xi, u, has finite moments of all orders. Let 



dg{e) 



(1 — oq)^' 1 — oq' (1 — cti)^' 1 — ai 



be the gradient of g and s^ = dg{9r)^'rdg{9rY , where is S' evaluated in 
{9r,d'r, Pr), cmd S' is defined in (20). Then the test statistic 



(3) 



sl \l — Oq 1 — ai 



converges in distribution to X^(l) under Hq and a.s. diverges to +oo under 
H[. 

In the case when ao = cki = 0, testing Hq or Hq boils down to testing 
{Po = Pi}- 

Proposition 38. In the BAR model (1), assume that oq = "i = and 
that the distribution of the ancestor Xi, v, has finite moments of all orders. 
Then the test statistic 

converges in distribution to A/'(0, 1) under {Pq = Pi} and almost surely tends 
to +00 (resp. —oo) under {Pq > Pi} (resp. {Pq < Pi}). 

Proof. Let f{x,y,z) =y-z. Observe that / G Cpoi(M^) and that Pf{x) = 
Po — Pii since ao = ai = 0. 

Let us assume that Pq = Pi. Then Pf = and Proposition 27(iii) im- 
phes that (Tr-\/2(l — Pr)ir converges in distribution to A/'(0,s^), where = 
(/U,P/^) = 2(7^(1 — p). Now, {a'^,pr) a.s. converges to the constant {cr'^,p) so 
that, with Slutsky's theorem, converges in distribution to A/'(0, 1). 

Let us now assume that Pq^ Pi- Proposition 28 states that the sequence 
I Tr-I""^^^ 0-^^2(1 - pr)ir = \^ rr'^ Y.iaJr f(^i) ^'^^ Converges to {p,Pf) = Pq- 
Pi ^0. Since (7r-\/2(l — pr) converges in probability to cr ■\/2(l — p) > 0, we 

1 1 /2 

conclude that a.s. diverges as \Tr\ , to +oo if Pq > Pi, and to — oo if 
Pq<Pi. □ 

Remark 39. Note that in the model where uq = ai = 0, Xr^"* would read 
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and is thus equal to The latter test looks like the ones Stewart et al. 
performed in [22]: it focuses on the differences X2i — ^2i+i between sisters. 
But it is relevant only in the case when the correlation parameters ao and 
ai are zero, that is, in a dynamics with no memory. Now, the data analysis 
strongly rejects this assumption, as we shall see in the next section. 



Remark 40. Note that all the propositions and lemmas of Section 3 
remain true if {{e2i,£2i+i),'i' > 1) is only assumed to be a sequence of (non- 
necessarily Gaussian) i.i.d. bivariate random vectors with finite moments of 
all orders, that is, lE[e2i" + ^2i+i] < ^^^^ m E N, with covariance matrix 
given by (2). In such a general case, {9j.,a1^Pr) has no reason to be the 
maximum likelihood estimator of {9,a,p). 



3.4. Data numerical analysis. We now perform the estimation and test 
procedures on Stewart et al.'s data (see Guyon [12] for more detailed re- 
sults). The data consists of 95 films, and each film should be seen as an 
incomplete binary tree of growth rates. How do we compute the estimators 
and test statistics? According to the above presentation, we should restrict 
the observation to the bigger complete subtree T^+i. We actually take into 
account all the observations, noting that: 

• very few cells are observed in a generation, say, r, when generation r — 1 
is not completely observed, 

• cells observed in the last generation are assumed to be the result of a 
random permutation 11, independent of X; this should be correct as a 
first approximation. 

Figure 5 gives the global empirical distribution of the residues e over 
the 95 films. We have separated new poles' residues (left) from old poles' 
ones (right), and fitted to normal distributions. Both histograms are close 
to Gaussian laws. 




-o.oig-Ei'.n»-i3.niisH3«i>t-ii.«Bii,a» a.ws oska o.m i.tas s.friD 




-0.i>ifMi.9s-D.w6-a«M-ii.in£g.iiQa v.mz a.oBt D.ne tiXK imv 



Fig. 5. Histograms of the residues i2n (new poles, left) and e2n+i (old poles, right), and 
their fit to Gaussian distributions. 
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xr>r 



0.0 0,1 0.2 0,3 0,1 0-5 0,6 0,7 0-8 0,9 1.0 

Fig. 6. Hq = {(ao,/3o) = (Qfi,/3i)}. Histogram of the p-values P(x^(2) > xibl)- 



Figure 6 shows that Hq can be strongly rejected. This indicates that the 
dynamics of the growth rate of the old pole offspring is different from that of 
the new pole offspring. The nullity of any parameter (oq, (3q, ai or can 
be strongly rejected as well. This enlightens the relevance of a Markovian 
modelization with memory one: the mother cell is a significant predictor of 
offspring growth rate in general. 

Besides, we cannot reject the hypothesis that both a's are equal on the 
one hand, and that both /3's are equal on the other hand. But we strongly 
reject that both fixed points, namely, 70 = /?o/(l — oo) and 71 = — ai), 
are equal; see Figure 7. Hence, the parametrization (a, 7), which makes more 
physical sense than the parametrization (a,/?), has the following advantage: 
with no assumption on the a's, we can detect aging by looking only at the 
7's, which we cannot do with the /3's. It also means that the new poles and 
the old poles are not only different in distribution, but also in mean. 

The scatter plot in Figure 7 indicates that 70 > 71 . More precisely, the 
line 7o = 71 + 5 fits well the data with 5 significantly positive. Numerically, 




0,0 ftl 93. 0.3 0,4 0,4 IIJ6 O.T 0.6 0.) I,D ««« 0,02a O.ISO UltDi 0,031 0,030 l)«3e 0.040 0.042 0,044 

Fig. 7. H'o = {A)/(l-"o) =/3i/(l-ai)},- left: histogram of the p-values P(x^(l) > Xobl).' 
right: /3o/(l — <3o) on the x-axis, — di) on the y-axis; the dashed line is the diagonal. 
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6 ^ 0.0012 ± 0.0011, or 0.0011 ± 0.0008 if we delete the two aberrant points 
in Figure 7 (right). This may be seen as statistical evidence of aging in E. 
Coli, since, on average, old pole cells grow slower than the new pole cells, 
which is characteristic of aged individuals. Quantitatively, they seem to grow 
3% slower (we may speak in terms of percentage since the range of values of 
7's is narrow). This result is close to Stewart et al.'s original calculations, 
since in [22] they estimated this ratio to be around 2%. 
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